Privacy-preserving publishing microdata with full functional dependencies
نویسندگان
چکیده
Article history: Received 23 January 2010 Received in revised form 30 October 2010 Accepted 2 November 2010 Available online 10 November 2010 Data publishing has generated much concern on individual privacy. Recent work has shown that different background knowledge can bring various threats to the privacy of published data. In this paper, we study the privacy threat from the full functional dependency (FFD) that is used as part of adversary knowledge. We show that the cross-attribute correlations by FFDs (e.g., Phone→Zipcode) can bring potential vulnerability. Unfortunately, none of the existing anonymization principles (e.g., k-anonymity, l-diversity, etc.) can effectively prevent against an FFD-based privacy attack. We formalize the FFD-based privacy attack and define the privacy model, d;l ð Þ-inference, to combat the FD-based attack. We distinguish the safe FFDs that will not jeopardize privacy from the unsafe ones. We design robust algorithms that can efficiently anonymize the microdata with low information loss when the unsafe FFDs are present. The efficiency and effectiveness of our approach are demonstrated by the empirical study. Published by Elsevier B.V.
منابع مشابه
Privacy-Preserving Publishing Data with Full Functional Dependencies
Stevens Institute of Technology Hoboken, NJ, USA {hwang,[email protected]} Abstract. We study the privacy threat by publishing data that contains full functional dependencies (FFDs). We show that the cross-attribute correlations by FFDs can bring potential vulnerability to privacy. Unfortunately, none of the existing anonymization principles can effectively prevent against the FFD-based priv...
متن کاملEfficient Techniques for Preserving Microdata Using Slicing
Privacy preserving publishing is the kind of techniques to apply privacy to collected vast amount of data. One of the recent problem prevailing is in the field of data publication. The data often consist of personally identifiable information so releasing such data consists of privacy problem. Several anonymization techniques such as generalization and bucketization have been designed for priva...
متن کاملA New View of Privacy in Social Networks: Strengthening Privacy during Propagation
Many smartphone-based applications need microdata, but publishing a microdata table may leak respondents’ privacy. Conventional researches on privacy-preserving data publishing focus on providing identical privacy protection to all data requesters. Considering that, instead of trapping in a small coterie, information usually propagates from friend to friend. The authors study the privacy-preser...
متن کاملPrivacy-Preserving Data Publishing in Linked Data Mashup Architectures
The mashup of microdata sources to form a data hub must fulfill a set of privacy preservation anonymity requirements that hinder data analysts to figure out sensitive information of the source datasets. This is relevant in a number of fields that include smart cities, electronic healthcare records and others. Linked data publishing architectures are not designed to adapt well to the requirement...
متن کاملAn Effective Grouping Method for Privacy-Preserving Bike Sharing Data Publishing
Bike sharing programs are eco-friendly transportation systems that are widespread in smart city environments. In this paper, we study the problem of privacy-preserving bike sharing microdata publishing. Bike sharing systems collect visiting information along with user identity and make it public by removing the user identity. Even after excluding user identification, the published bike sharing ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Data Knowl. Eng.
دوره 70 شماره
صفحات -
تاریخ انتشار 2011